Short Text Semantic Similarity Measurement Approach Based on Semantic Network

نویسندگان

چکیده

Estimating the semantic similarity between short texts plays an increasingly prominent role in many fields related to text mining and natural language processing applications, especially with large increase volume of textual data that is produced daily. Traditional approaches for calculating degree two texts, based on words they share, do not perform well because similar may be written different terms by employing synonyms. As a result, should semantically compared. In this paper, measurement method presented which combines knowledge-based corpus-based information build network represents relationship compared extracts them. Representing as best knowledge representation comes close human mind's understanding where reflects sentence's semantic, syntactical, structural knowledge. The visual objects, their qualities, relationships. WordNet lexical database has been used source while GloVe pre-trained word embedding vectors have source. proposed was tested using three datasets, DSCS, SICK, MOHLER datasets. A good result obtained RMSE MAE.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Benchmarking short text semantic similarity

Short Text Semantic Similarity measurement is a new and rapidly growing field of research. “Short texts” are typically sentence length but are not required to be grammatically correct. There is great potential for applying these measures in fields such as Information Retrieval, Dialogue Management and Question Answering. A dataset of 65 sentence pairs, with similarity ratings, produced in 2006 ...

متن کامل

Text-to-Text Semantic Similarity for Automatic Short Answer Grading

In this paper, we explore unsupervised techniques for the task of automatic short answer grading. We compare a number of knowledge-based and corpus-based measures of text similarity, evaluate the effect of domain and size on the corpus-based measures, and also introduce a novel technique to improve the performance of the system by integrating automatic feedback from the student answers. Overall...

متن کامل

ECNUCS: Measuring Short Text Semantic Equivalence Using Multiple Similarity Measurements

This paper reports our submissions to the Semantic Textual Similarity (STS) task in ∗SEM Shared Task 2013. We submitted three Support Vector Regression (SVR) systems in core task, using 6 types of similarity measures, i.e., string similarity, number similarity, knowledge-based similarity, corpus-based similarity, syntactic dependency similarity and machine translation similarity. Our third syst...

متن کامل

A Comparative Study of Two Short Text Semantic Similarity Measures

This paper describes a comparative study of STASIS and LSA. These measures of semantic similarity can be applied to short texts for use in Conversational Agents (CAs). CAs are computer programs that interact with humans through natural language dialogue. Business organizations have spent large sums of money in recent years developing them for online customer selfservice, but achievements have b...

متن کامل

English-Persian Plagiarism Detection based on a Semantic Approach

Plagiarism which is defined as “the wrongful appropriation of other writers’ or authors’ works and ideas without citing or informing them” poses a major challenge to knowledge spread publication. Plagiarism has been placed in four categories of direct, paraphrasing (rewriting), translation, and combinatory. This paper addresses translational plagiarism which is sometimes referred to as cross-li...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Baghdad Science Journal

سال: 2022

ISSN: ['2078-8665', '2411-7986']

DOI: https://doi.org/10.21123/bsj.2022.7255